IIIT Hyderabad at TAC 2008
نویسندگان
چکیده
This paper describes our participation at TAC 2008 in all the three tracks. For the Summarization Track we introduced two major features. First, a feature based on Information Loss if we don’t pick a particular sentence. Second, a language modeling extension that boosts novel terms and penalizes stale terms. During our post-TAC analysis we observed that a simple sentence position based summarizer leads to better short summaries than most official runs submitted this year. In the Opinion QA and Summarization Track for the rigid list questions, we have added some additional features to handle opinion expressed in the question. and for the squishy list questions in Opinion QA and Summarization Track, we leveraged on our existing Summarization engine and used a classification based approach to both finding opinionated sentences and also the polarity of the opinions. Finally, for the RTE track we explored a simple graph partition matching based approach.
منابع مشابه
IIIT Hyderabad in Summarization and Knowledge Base Population at TAC 2011
In this report, we present details about the participation of IIIT Hyderabad in Guided Summarization and Knowledge Base Population tracks at TAC 2011. we have enhanced our summarization system with knowledge based measures. Wikipedia based extraction methods and topic modelling are used to score sentences in guided summarization track. For multilingual summarization task, we investigated the HA...
متن کاملIIIT Hyderabad in Guided Summarization and Knowledge Base Population
In this report, we present details about the participation of IIIT Hyderabad in Guided Summarization and Knowledge Base Population tracks at TAC 2010. This year, we enhanced our summaization system with knowledge based measures and utilized domain and sentence tag models to score sentences to suit guided summarization track. We have used an external tool, WikiMiner to identify key concepts in t...
متن کاملIIIT Hyderabad at TAC 2009
In this paper, we report our participation in Update Summarization, Knowledge Base Population and Recognizing Textual Entailment at TAC 2009. This year, we enhanced our basic summaization system with support vector regression to better estimate the combined affect of different features in ranking. A Novelty measure is devised to effectively capture relevance and novelty of a term. For Knowledge...
متن کاملIIIT Hyderabad at TAC 2012
In this paper, we report our participation in Knowledge Base Population at TAC 2012. We adopted an Information Retrieval based approach for the Entity Linking and Slot Filling tasks. In Entity Linking we identify potential nodes from the Knowledge Base and then identify the mapping node using tf-idf similarity. We achieved very good performance in the Entity Linking task. For Slot Filling task ...
متن کاملCross Lingual Information Access System for Indian Languages
The CLIA (Cross Lingual Information Access) Project is a mission mode project funded by Government of India, Ministry of Communications & Information Technology, Department of Information Technology vide its approval No. 14(5)/2006 – HCC (TDIL), Dated 29-08-2006. It is being executed by a consortium of 11 academic and research institutions and industry partners, IIT Bombay, IIT Kharagpur, IIIT ...
متن کامل